Picture for Yu Su

Yu Su

AGENTCL: Toward Rigorous Evaluation of Continual Learning in Language Agents

Add code
Jun 01, 2026
Viaarxiv icon

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

Add code
Jun 01, 2026
Viaarxiv icon

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

Add code
May 28, 2026
Viaarxiv icon

QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks

Add code
May 22, 2026
Viaarxiv icon

Leveraging Latent Visual Reasoning in Silence

Add code
May 18, 2026
Viaarxiv icon

Automatic Image-Level Morphological Trait Annotation for Organismal Images

Add code
Apr 02, 2026
Viaarxiv icon

CUBE: A Standard for Unifying Agent Benchmarks

Add code
Mar 16, 2026
Viaarxiv icon

REMem: Reasoning with Episodic Memory in Language Agent

Add code
Feb 13, 2026
Viaarxiv icon

Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation

Add code
Feb 10, 2026
Viaarxiv icon

When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

Add code
Feb 09, 2026
Viaarxiv icon